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(54) Method and device for control and delivery of digitally compressed visual data in a 
heterogeneous communication network 

(57) A method and apparatus automattoally manip- 
ulates a digitally compressed vWeo bitstream such that 
the result is compatible with different bandwidth and 
error resilience requirements at a receiving point In a 
heterogeneous communication network. The method 
and devfce provide for analysis of bitstream header 
informatton to determine its relevant coding syntax 
parameters. Feedback from the receiving point on the 
network is then used to identify error robustness 
requirements for an intermediate bitstream. The bit- 
stream is then manipulated by an error robustness reg- 
ulator to achieve error protection in the network. The 
bitrate requirements of the network establish a selection 
of relevant and irrelevant bits from the bitstream The 
present method and device include identlficatton of irrel- 
evant and low priority video bits; intentional introduction 
of packet losses to reduce bitrate while maintaining vis- 
ual quality; use of replacement tags for irrelevant and 
low priority components of the bitstream for compatible 
decoding by a wide array of decoders; and insertion of 
resynchronization tags and translation of predictively 
coded video packet bits for the improvement of error 
resilience. 
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Description 

Field of the Invention 

[0001] The present invention relates to automati- 
cally managing the delivery of digitally compressed vis- 
ual data, and more particularly, to a device and method 
for controlling the bitrate and error robustness of digit- 
ally compressed video bitstreams in a heterogeneous 
communications network. 

Background o1 the Invention 

[0002] With the advent of new communication sys- 
tems, it has become possible to communicate digital 
visual information and data in addition to just voice serv- 
ices efficiently. Such new communication systems com- 
prise both wireline and wireless networks. In the context 
of wireline networks, packet and circuit switched com- 
munication systems have been devebped in both the 
private and public domains for wide and local area cov- 
erage. As part of packet-based communications, a sig- 
nificant amount of data is exchanged between local 
area networks, such as those which are part of a corpo- 
rate infrastmcture, and wide area networks, such as the 
public Internet Services to guarantee Quafity of Servtee 
(QoS) have emerged In this realm to support visual and 
multimedia communication. 

[0003] New wireless systems have also been devel- 
oped to support the transmission of information at 
higher bitrates than previously possible. These wireless 
systems conform to international cellular communica- 
tton standards, and consist of digital communteation air- 
interface standards using Time Division Multiple Access 
(TDMA) and Code Division Multiple Access (CDMA). An 
example of wkiely deployed TDMA communication is 
the Gbbal System for Mobile oommuntoation (GSM). 
CDMA forms the basis for the third generation cellular 
communication standard IMT-2000. 
[0004] Most of these communication systems (wire- 
line and wireless) have been designed to allow lor the 
exchange of a maximum amount of information 
between users connected within the Infrastructure of 
each system and those using devices outsMe that infra- 
structure. For example, IMT-2000 protocols can support 
data communication from wireless terminals to termi- 
nals residing on packet switched wireline systems. 
Moreover, within these communteation systems, termi- 
nals, or clients, may connect to other terminals, acting 
as sen/ers, at different bitrates depending on their loca- 
tton in each system. This leads to the concept of heter- 
ogeneous communication networks, which are 
networks consisting of diverse or dissimilar connections 
yet often communicating similar informatton. 
[0005] Problems arise when considering the com- 
municatton of visual information in heterogeneous net- 
works. A first problematic issue which arises is that 
visual information, and, in particular, digitally com- 



pressed vkleo signals, require substantially more band- 
width than comparable voice signals compressed over 
similar time intervals. Transmitting visual information 
between systems designed with different nominal trans- 

5 misston bitrates presents some difficulty because of 
delays that result from these incompatible bitrates. In 
addition, varying en^or characteristtes of the different 
communication systems in a heterogeneous network 
also present a problem for the transmission of visual 

w information because it is very diffteult to mask errors in 
the visual space. 

[0006] These problems do not alone make up the 
difficulties with communicating visual information over 
diverse systems. Services designed to exploit the com- 

15 munication of visual information In heterogeneous net- 
works must, necessarily, rely on the existence of 
digitally compressed video. Therefore, not only creating 
new content, but exploiting existing content is a primary 
focus for the further development and success of these 

20 services. For one-way visual . applicat'ions, there is 
already a substantial amount of digitally encoded con- 
tent for training and entertainment that could be deliv- 
ered to mobile subscribers on a wireless network, or to 
clients connected at dissimilar rates on a wireline net- 

25 work. Existing, or "legacy" vWeo material is primarily 
encoded using the ISO MPEG-1 and MPEG-2 stand- 
ards, although legacy content also exists in the form of 
the ITU-T H.261 and, to a lesser extent, H.263 stand- 
ards. This material can have high value to new multime- 

30 dia services, while at the same time lead to the 
acceptance and use of new visual coding standards 
such as ISO MPEG-4. 

[0007] One problem with the delivery of legacy data 
over the low bandwidth wired and wireless networks is 

35 that the original encoded bandwidth typfeally exceeds 
the capabilities of ttie network, assuming that there is a 
maximum delay restriction on ttie delivery of that data. 
for example, some wireless mobile systems may have 
multimedia terminals supporting MPEG-1 decoding, but 

40 ttie channel rates to the multimedia enabled mobile ter- 
minals may be as low as 64Kbps. In this case, the 
MPEG-1 material, encoded at 1.5Mbps, would require 
substantial buffering delay time before playback couki 
begin. Unless most of the sequence was to l>e down- 

45 toaded first, stalling would occur, as the decoder's buffer 
would empty much faster than it would fill. It is also 
unlikely that a mobile terminal would be able to provide 
suffbient memory for buffering long sequences. A 
requirement exists, therefore, to successfully manipu- 

50 late this type of high bitrate data to a bwer rate such 
that it is compatible with a low bandwWtti connection on 
the network. 

[0008] As a result of the arrival of new wired and 
wireless communicatk>n systems with the capacity to 
55 transmit and exchange visual information, there is a sig- 
nificant need for a method and device designed to man- 
age the visual content being delivered over these 
networks. In particular, technology is needed to auto- 
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malically manage the bitrate and error robustness of 
pre-existing digital video bitstreams so that they can be 
delivered in a compatible form to users at arbitrary 
nodes in a heterogeneous network. 

Summary of the Invention 

[0009] Briefly, therefore, this invention provides a 
method and apparatus for changing a digitally com- 
pressed video bitstream at a source point in a network, 
in an automatic way, such that the resulting bitstream is 
compatible with different bandwidth and error resilience 
requirements at a receiving point in a heterogeneous 
communication network. The novel method consists of 
analyzing the header information of the original bit- 
stream to determine its coding parameters. Feedback 
from the receiving point on the network is then used to 
determine error properties of the network. The band- 
width and enror resilience requirements establish a pri- 
oritization and selection of relevant and irrelevant bits 
from the original bitstream, of which the relevant bits are 
manipulated by a robustness regulator to achieve error 
protectton In the network. The principal inventive com- 
ponents of the method include identification of in-elevant 
and low priority video bits, intentional introduction of 
packet losses to reduce bitrate, use of replacement tags 
for irrelevant and tow priority components of the bit- 
stream, and insertton of resynchronization tags for the 
improvement of en-or resilience. 

Brief Descri ption of the Drawings 
[0010] 

FIG. 1 is a block diagram of a bitstream regulation 
system operative to manipulate an incoming bit- 
stream and output a bitstream compatible ¥nth a 
receiving terminal in accordance with an embodi- 
ment of the present invention. 
FIG. 2 is a generalized flow diagram of a preferred 
embodiment of a method for regulating a bitstream 
encoded with scalable Informatton to produce a bit- 
stream compatible with a receiving terminal in 
accordance with the present invention. 
FIGS. 3A, 3B and 3C lllustrativety provide examples 
of insertion of resynchronization tags into a bit- 
stream and translation of differentially encoded 
components within a video packet to supportrobust 
decoding. 

FIGS. 4A and 48 illustratively provide examples of 
insertion of replacement tags into a bitstream after 
bitrate regulation to preserve the decoded quality at 
a compatible receiving terminal. 
FIG. 5 illustrates a block diagram of a preferred 
embodiment of a device for manipulating bits in a 
bitstream of encoded video data to produce a bit- 
stream compatible with a receiving terminal in 
accordance with the present invention. 



Detailed Description of the Preferred Embodiments 

[001 1] The present invention is more fully described 
with reference to FIGS. 1 - 5. FIG. 1 illustrates a block 

5 diagram 100 of a visual bitstream regulation method 
which defines a control system operated in the manner 
described herein. The input bitstream 101 is passed 
through a header analysis stage 103 which extracts 
information about the type of standard used to create 

10 the bitstream. This information is passed through the 
standard type identification stage 105, which results in 
the assignment of control parameters in the main bit- 
stream control 107 which is coupled to accept input 
parameters describing the receiving terminal's capabtli- 

15 ties 1 09. These steps are designed to set up the system 
for proper regulation and compatible delivery of the bit- 
stream from the sender to the receiving terminal. In par- 
ticular, Wentification of the original visual coding 
standard used to encode the bitstream coupled with 

20 information about the capabilities of the receiving termi- 
nal determines the extent to which the bitstream must 
be manipulated before compatible communication is 
possible. 

[0012] The output of the main bitstream control step 

25 IS coupled to the robustness regulation stage 111. In 
this stage, the bitrate-regulated bitstream is modified to 
provide a bitstream which is compatible with the error 
characteristics of the channel connecting the source 
and receiving terminals. This robustness regulation is 

30 dependent on the standard type of the original bitstream 
identified in 105. The regulation is accomplished 
through the packetizatlon of bits in the coded frames of 
visual information in such a way that errors may be 
detected, tocalized and concealed by a robust decoder. 

35 The use of resynchronization tags 1 1 3 is applied for vis- 
ual bitstreams conlbrming to standards which support 
sub-frame resynchronization. The robustness regula- 
tion step manipulates the bitstream such that independ- 
ent decoding is possible for each segment of bits 

40 delimited by the inserted resynchronization tags. In the 
case of predictively coded information, this information 
is replaced with equivalent information utilizing only pre- 
dictors within the same segment This stage is referred 
to as vkjeo packet translation 115. For example, this 

45 would apply to the replacement of codewords for motion 
vector differences computed across resynchronization 
segments with motion vector differences computed from 
information in the same segment. 
[0013] The output of the video packet translation 

50 step is directed to the bitrate regulation stage 1 1 9 which 
is coupled to receive intentional packet loss information 
117 and, additionally, to discard irrelevant bits 121 in 
order to reduce the bitstream bitrate. The purpose of the 
bitrate regulation stage is to adjust the bitrate of the 

55 input bitstream so that the delivered stream is compati- 
ble with the capabilities of the receiving terminal and its 
delivery channel. This is accomplished through manipu- 
lation and removal of bits in the incoming bitstream. The 
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present method does not provide a mechanism for 
explicit decoding and re-encoding of bitstreams, also 
known as transcoding, but rather supports translation of 
codewords In the compressed domain of the bitstream. 
The bitrate regulator selectively removes Information 5 
from the Incoming bitstream so that the output bitstream 
conform to a compatible delivery bitrate. Intentional 
packet losses 117 provide a mechanism to reduce the 
bitrate below the lowest base layer bitrate of the original 
Incoming bitstream. The term base layer is used to w 
describe the towest priority independently decodable 
layer of a scalable bitstream. When selected packets of 
bits are dropped, the truncated bitstream can still be 
effectively decoded without substantial quality degrada- 
tion by a robust decoder at the receiving terminal. 15 
[0014] The use of replacement tags 123 Inserted 
into the bitstream enables a decoder to seamlessly 
decode a bitrate-regulated bitstream without additional 
capabilities over those of the nominal relevant standard 
decoder. An example of this is the case where a stand- 20 
ard such as MPEG-2 requires a pre-specified coded 
frame rate, such as 30 frames per second, but where 
frames are dropped by the bitrate regulator. In this 
instance, the replacement tags are predetermined 
binary codewords which represent empty frames, and 25 
which preserve the time relationships of coded frames 
in the bitstream. 

[0015] Finally, the manipulated incoming bitstream 
is delivered to the heterogeneous network and subse- 
quently to the receiving terminal. The inventive method 30 
100 outlined above provides a solution to the problem of 
delivering standardized digitally compressed visual 
information from a variety of sources to receiving termi- 
nals with dissimilar bandwidth and error robustness 
capabilities. The overall method supports all classes of 3S 
existing visual coding standards by adapting to the spe- 
cific syntax elements of each, and utilizing each stage of 
the present method for Its purpose only to the degree to 
which it still produces a bitstream which conforms to 
that standard, ^ 
[001 6] FIG. 2 illustrates a generalized f tow diagram 
200 of a preferred embodiment of a method for regulat- 
ing a bitstream encoded with prioritized frame types 
such as INTRA (1), PREDICTED (P), and 
BLDIRECTIONALLY PREDICTED (B) frames, and/or 45 
scalable frame types including temporal, spatial, and 
SNR scalability frames. Examples of voluntary interna- 
tional standards for the bitstreams having 1, P. and 8 
frames include MPEG-1 , MPEG-2. MPEG-4. and H.263 
v2. Examples of standards for bitstreams having scala- so 
bllity frames include MPEG-2. H.263 v2, and MPEG-4. 
In all cases, the preferred embodiment 200 defines a 
method for manipulating a standards compliant bit- 
stream with multiple frame types in order to regulate the 
bitrate and error robustness of that bitstream before ss 
delivery over a heterogeneous network. 
[0017] The preferred embodiment of the present 
invention is shown in tiie flow diagram 200 with respect 



to an incoming MPEG-4 bitstream 201 with two layers: 
a base layer 209 and a temporal scalability enhance- 
ment layer 21 1. The temporal scalability enhancement 
layer is a syntax element supported by the MPEG-4 
standard, and consists of multiple frames of video which 
are temporally interleaved with the video frames of the 
base layer. A decoder can decode all, some, or no 
frames of the temporal scalability layer without affecting 
the nominal quality of the decoded base layer. This is a 
result of the fact that the scalability layer is not used for 
the prediction of any informatfon at an equal or lower 
layer of scalability in the syntax of MPEG-4. 
[0018] The incoming bitstream 201 is analyzed as 
shown at 203 with the input of a predetermined list of 
standard header codes 204. The analysis exhaustively 
compares tiie list of standard header codes with the rel- 
evant leading bits in the incoming bitstream to deter- 
mine the standard and syntax used to compress the 
bitsti^eam. The bitstream is then demultiplexed 207 to 
provkJe two independent bitstreams, a base layer 209 
and a temporal scalability layer 211, Parameters defin- 
ing the relationship and type of tiie base and enhance- 
ment layers are extracted from the incoming bitstream 
205 and passed to the main bitstream control stage 213 
of this preferred embodiment The independent bit- 
streams 209 and 21 1 are jointly processed by the main 
bitstream control stage. The main bitstream control 
stage is also coupled to receive control signals from the 
network 215 whfch describe the capabilities of the 
receiving terminal, kJentified as "Recv. Caps" in 215. In 
parttoular, these control signals indicate tiie desired 
bandwkjth to be transmitted over tiie network and the 
location of the receiver terminal on the network. This 
location information describes the channel state of the 
receiving terminal and indicates whether it is connected 
through a wired or wireless branch of the network. 
[0019] The main bitstream control 213 passes the 
bitstreams to the robustness regulation stage 217. The 
error resilience of the input data is adjusted so that tiie 
effect of errors is mitigated in the decoder at the receiv- 
ing terminal. In the preferred emkx>diment described by 
the flow diagram 200 in FIG.2, this is accomplished by 
the insertion of resync markers 219 into the base layer 
bitstream 209, a syntax element of MPEG-4 which per- 
mits the localization of en-ors in a robust decoder. With 
the insertion of tiiese markers, it can be necessary to 
adjust the codewords in the bitstream so that predic- 
tivety coded Information does not require values from 
independent vWeo packets, which are spatial regions 
represented by bits contained in segments delimited by 
different sets of resync markers. This is accommodated 
by the translation of differentially encoded codes 221. 
explained In more detail below. 
[0020] FIGS. 3A. 38 and 3C illustratively provide 
examples of insertion of resynchronizatton tags into a 
bitstream and translation of differentially encoded com- 
ponents within a video packet to support robust decod- 
ing. In FIG. 3A, an illustration 301 is shown of an original 
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video frame bitstream of the highest level of syntax ele- 
ments of a video frame having no sub-frame resynchro- 
nization. Such a bitstream has poor error resilience in 
the presence of channel errors because all of the infor- 
mation is predictively coded within the coded macrob- 
lock data portion of the bitstream. When errors are 
introduced, it is impossible to continue decoding until 
the next unique video frame header is encountered. 
This poor localization of errors is remedied by the 
present Inventbn as illustrated in FIG. 3B and generally 
designated at 303. In the illustration 303, a bitstream 
with video packet resynchronization is shown. In this bit- 
stream, the inserted resynchronization tags enable a 
decoder to localize errors to smaller portions of the bit- 
stream, and consequently produce a greater level of 
quality at a receiving terminal in an error prone hetero- 
geneous communications network. The method for 
translating predictively coded codewords within the 
newly formed video packets is illustrated at 305 in FIG. 
30 with respect to the most common predictively coded 
syntax element: motion vectors. As seen in the enlarged 
detail view In FIG. 3C, the motion vector codeword at 
macroblock X is computed as the difference of the 
motion vector at X and a function of those at neighbor- 
ing macroblock positions A, B, and C. When a resyn- 
chronization tag is inserted as described with reference 
to FIG. 38, a new video packet boundary for predictive 
codes arises. This boundary prevents the corruption of 
the cun-ent video packet in the event that any neighbor- 
ing video packet is con-upted due to eaors. Thus, the 
motK>n vector codeword, refen-ed to as a motion vector 
difference (MVD) is computed only with respect to the 
nearest macroblock residing within the same video 
packet This Is shown in FIG. 3C as MVX'f{MVCl 
instead of MVX'f(MVAMVBMVCl where ^(a) is a 
function defined by the syntax of the relevant video 
standard, and usually defined as the median operator. 
The method of localizing predictive codes includes 
parsing and kjentifying only the present codewords 
which use predictive coding in the relevant standard, 
and translating those to the appropriate symbols for the 
localized predictive codewords. In the context of MPEG- 
4, this applies to simple translatk)n of motion vector dif- 
ference codewords, as illustratively shown at 303 in 
FIG. 3C, and in the same fashion to any predictively 
coded texture data including quantization parameters 
and DCT coefficient values. 

[0021] After the robustness regulation stage 217, 
the nrK>dified bitstream data, and those components of 
the bitstreams not modified by steps 219 and 221 are 
delivered to the bitrate regulation stage 223, Two tech- 
niques are used to achieve the desired bitrate, which is 
lower than the bitrate of the incoming bitstream. The 
first approach to the reductbn of bitrate is through the 
discarding of bits associated with tow priority frames in 
the bitstream. Bits associated with temporal scalability 
frames represent the lowest priority bits and are dis- 
carded from the incoming bitstream first as shown at 



227. The base layer bitstream represents the lowest 
bitrate bitstream achievable without directly impacting 
the performance of a typical decoder. Intentional packet 
losses are then introduced through packet loss indicator 

5 flags 225 coupled to the bitrate regulation stage 223. 
These packet toss indicator flags 225 are used to dis- 
card bits 229 from the base layer bitstream resulting in 
an output bitstream which has reduced bitrate. 
[0022] The introductton of intentional packet tosses 

10 represents a key novel element of this preferred embod- 
iment. The particular means to Intentionally introduce 
packet tosses to the base layer bitstream uses a prede- 
termined dropping scheme. A predetermined scheme 
for intentionally dropping packets is to remove the bits 

IS associated with non-contiguous togical resynchroniza- 
tion segments of each vWeo frame, known In MPEG-4 
as video packets, at uniform spacing until the desired 
bitrate reduction has been achieved. When a packet is 
dropped from a frame, a packet which does not overlap 

>o the same spatial regton in the next frame is dropped. 
This prevents the serious degradatton of any single 
regton of the scene. This is accompfished by keeping 
historical information about the macroblock numbers in 
each dropped packet Packets t>eginning with macrob- 

25 lock numbers overlapping the macroblock range of the 
last dropped packet are not dropped. The first sequen- 
tial non-overlapping packet in the next frame is the first 
candidate for dropping according to this method. This 
use of statistical information in the introduction of packet 

30 losses is integral in reducing the bitrate without substan- 
tially degrading the video quality. 
[0023] When 'data partitioning" has been used 
within a video packet of the incoming bitstream, only the 
texture coding components of the video packet are 

35 dropped in ttiis prefen'ed embodiment When data parti- 
tioning has not been used within a video packet of the 
incoming bitstream. the combined motion and texture 
components of the video packet are dropped In this pre- 
ferred embodiment When sub-frame resynchronization 

40 markers have not t>een used in the original bitstream, or 
inserted by the rot)ustness regulatton stage, entire 
frames of video are dropped by this method. When 
dropping entire frames,^ B frames are the lowest priority 
and dropped first, followed by P and tiien I frames. P 

45 Frames which are furthest in time from a previous I 
frame are dropped prior to other P frames. This is 
because the likelihood of new I frames appearing in the 
bitstream for resynchronizing the scene increases in 
time from the last coded I frame. The use of this stage 

so of the method is enhanced by the existence of a rot)ust 
decoder at the receiving terminal 239. Though not 
required for the successful implementation of the 
present invention, such a terminal will conceal the 
errors introduced by dropping information as indicated 

55 by 223, 

[0024] If the receiving terminal capabilities indicate 
that the video decoder is not a robust decoder, it is nec- 
essary to insert replacement information into that bit- 
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stream in order to guarantee accurate decoding at a 
receiving terminal. The introduction of replacement 
codeword bits 231 accomplishes this task in the next 
step of the method. In a preferred embodiment as 
shown by the flow diagram 200, resynchronization 
packets are dropped in step 223. FIGS. 4A and 4B fur- 
ther describe the Inserting of replacement information 
for two scenarios of dropped MPEG-4 video packet 
Information in accordance with this prefen-ed embodi- 
ment. In the first example 401 shown in FIG. 4A, bits 
associated with the texture coding part of a video packet 
are dropped by the bitrate regulator 217 in FIG. 2. Por- 
tk)n 401a of the first example identified an original data 
partitioned video packet bitstream; and portion 401b 
represents video packet bitstream with replacement 
tags. The bits dropped, which are delimited by a unk^ue 
motion marker, are associated vkh the spatial predic- 
tfon error codewords (CBPY, DO, DCT), which decode 
to values applied to a spatial portton of a vkieo frame 
after motion compensation has been applied. This Infor- 
mation is the lowest priority component of an MPEG-4 
video packet. While this example illustratively provkJes 
details for the syntax of MPEG-4, the Wentteal concept 
applies to the video syntax of the MPEG-2 standard. In 
example 401, replacement bits are inserted into the bit- 
stream in the form of code words for zero valued predk:- 
tton error coefficients (CBPr.DQ'.DCT). These 
codewords require negligible bitrate in relation to the 
bitrate consumed by the dropped bits. A more extreme 
case of the same concept is illustrated In example 403 
shown in FIG. 48, where data partittoning is not present 
in the incoming bitstream. Portion 403a of FIG. 48 rep- 
resents an original combined motion texture vkjeo 
packet bitstream; and portion 403b represents vWeo 
packet bitstream with replacement tags. In this case, all 
of the bits associated with the payload of the vkdeo 
packet are dropped. The vUeo packet header remains 
and is used to indicate the bounding macroblock infor- 
matton Ibr the video packet The replacement bits used 
are the codewords for combined motion and texture 
information for blocks containing no motion and no pre- 
dkJtion error information (CBP'). This is referred to as 
"NOT CODED" codewords in example 403. Again, as in 
example 401 , the bitrate consumed by the replacement 
bits is negligible in comparison to the bitrate consumed 
by the original video packet under conskieration. 
[0025] Continuing with the description of the pre- 
ferred embodiment of the present method described by 
FIG 2, a bitrate regulated bitstream is output at 233 from 
the bitrate regulation stage. The output bitstream is 
delivered to the heterogeneous network 237 for subse- 
quent decoding at the receiving terminal 239. 
[0026] FIG. 5 illustrates a diagram of a device 500 
for manipulating a digitally compressed video bitstream 
to produce an output bitstream compatible with the 
bandwidth and error resilience requirements of hetero- 
geneous communications network. The device 500 
comprises a header analysis unit for determining a 
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standard type and a syntax used to compress the input 
bitstream 501 coupled to a main bitstream controller 
505. A header analysis unit 503 is coupled to receive 
input from a memory module holding a list of predeter- 

5 mined standard header codes 504 enabling the analysis 
unit to compute the present standard and syntax by 
exhaustive matching of header bits. The main bitstream 
controller 505 is a means for discarding selected bits 
from the Input bitstream on the basis of the syntax 

10 determined in the unit 503. The main bitstream control- 
ler 505 comprises a rot)UStness regulation module 509, 
a bitrate regulation module 511, and a replacement tag 
generator 51 5. 

[0027] The robustness regulation module 509 is a 
IS means for inserting predetermined bits into the bit- 
stream to improve the en-or robust tocalization and 
decoding of the bitstream In accordance with the stand- 
ard type and syntax determined in the unit 503. The 
robustness regulation nrK>duie 509 also translates any 
20 predictively coded Information into codewords relying 
only on bits within the present vkdeo packet in accord- 
ance with inserted resynchronization markers. Opera- 
tion of the robustness reguiatton module 509 is reliant 
on signals from the header analysis unit 503 and feed- 
25 back from the receiving terminal 507, which are both 
coupled to input to the main bitstream controller 505 
containing those modules. 

[0028] The output of the robustness regulation 
module 509 is coupled to input to the bitrate regulation 

30 module 51 1 , whteh is a means to discard selected bits 
513 from the bitstream on the basis of the standard type 
and the syntax determined in the unit 503. The bitrate 
regulation module 509 Is a means for dropping low pri- 
ority bits associated with frames in an enhancement 

35 layer of a scalable bitstream, and means for discarding 
packets of low priority bits associated with segments in 
a packet of video bits delimited by resynchronizatton 
markers or picture headers. The bitrate regulation mod- 
ule 509 also contains memory for recording historical 

40 information of the packets dropped in a previous frame 
which is used to compute the next packets to drop. This 
enables avoiding repetitive removal of identical spatial 
areas of a scene. 

[0029] The bitrate regulation module 51 1 produces 
45 an output which is compatible with the capabilities of a 
receiving terminal as indicated by the feedback signal 
507 coupled to input to the main bitstream controller 
505. The output of the bitrate regulation module 51 1 is 
coupled to the input of the replacement tag generator 
50 515, which is a means for inserting predetermined bits 
to improve the decoding of the bitstream in accordance 
with the standard type and syntax determined in the unit 
503. In particular, the replacement tag generator 515 
replaces bits dropped by the bitrate regulation module 
55 509 with short code words designed to preserve timing 
and spatial Information in any decoder such that it will 
not suffer from unrecoverable errors, 
[0030] The output of the main bitstream controller 
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505 is coupled to a heterogeneous network 519, which 
in turn delivers a resultant bitstream 517 to a robust 
decoder at a receiving terminal 521, The device 500 is 
capable of manipulating bitstreams generated by any 
video coding standard. Depending on the degree of 5 
flexibility of the particular standard, varying degrees of 
functionality in terms of bitrate regulation and error 
robustness are achieved. In general, the most recent 
standards, H.263 and MPEG-4, can provide for the 
most fine control of these bitstream attributes by the 10 
device 500. 

[0031] Although exemplary embodiments are 
described above, it will be obvious to those skilled in the 
art that many modifications and alterations to this 
method and device for bitstream control and delivery is 
may be made without departing from the invention. 
Accordingly, it is intended that all such modificatbns 
and alterations be included within the spirit and scope of 
the invention as defined in the following claims. 

20 

Claims 

1. A method for manipulating bits in a bitstream of 
encoded video data, said method comprising the 
steps of: 25 

determining a standard type and a syntax used 
to compress the bitstream; 
inserting predetermined bits into the bitstream 
and translating predictive ly coded information 30 
resulting in an intermediate bitstream with 
improvement of error robust localization and 
decoding of the bitstream in accordance with 
the standard type and the syntax; and 
discarding selected bits from the intermediate 35 
bitstream based on the standard type and the 
syntax used to compress the bitstream result- 
ing in an output bitstream having a bitrate com- 
patible with that of a robust receiving terminal. 

40 

2. The method of Claim 1 wherein the step of deter- 
mining standard type and a syntax used to com- 
press the bitstream further comprises the step of: 

reading header bits; and 45 
comparing said bits to a predetermined list of 
standard headers to identify a present standard 
type and syntax used to compress the bit- 
stream. 

50 

3. The method of Claim 1 wherein the step of inserting 
predetermined bits and translating predictive ly 
coded information further comprises the step of: 

inserting resynchronization markers; and 55 
translating differentially encoded motion vec- 
tors and associated texture data parameters to 
limit predictive coding to within a video packet. 



4. The method of Claim 1 wherein said step of dis- 
carding selected bits further comprises the step of: 

discarding low priority bits associated with 
frames in an enhancement layer of a scalable 
bitstream. 

5. The method of Claim 1 wherein said step of dis- 
carding selected bits further comprises the step of: 

discarding packets of low priority bits associ- 
ated with segments in a packet of video bits 
delimited by resynchronization markers or pic- 
ture headers. 

6. The method of Claim 5 wherein the step of discard- 
ing packets of low prtority bits further comprises the 
step of: 

selecting packets to discard on the basis of his- 
torical information of packets dropped In a pre- 
vious frame to avokJ repetitive removal of 
identical spatial areas of a scene. 

7. A device for manipulating bits in a bitstream of 
encoded vkieo data, said device comprising: 

means for determining a standard type and a 
syntax used to compress the bitstream; 
means for inserting predetermined bits into the 
bitstream and translating predictively coded 
informatfon resulting in an Intermediate bit- 
stream with improvement of en-or robust focaii- 
zation and decoding of the bitstream in 
accordance with the standard type and the syn- 
tax; and 

means for discarding selected bits from the 
intermediate bitstream based on the standard 
type and ttie syntax used to compress the bit- 
stream resulting in an output bitstream having 
a bitrate compatible with tfiat of a robust receh^- 
Ing terminal. 

8. The device of Claim 7 wherein said means for 
determining a standard type and a syntax used to 
compress the bitstream further comprises: 

means for reading header bits and comparing 
said bits to a predetermined list of standard 
headers to identify a present standard type and 
syntax used to compress the bitstream. 

9. The device of Claim 7 whereirTsaid means for 
inserting predetermined bits and translating predic- 
tively coded information further comprises: 

means for inserting resynchronization markers 
and translating differentially encoded motion 
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vectors and associated texture data parame- 
ters to limit predictive coding to witiiin a video 
packet. 

10. Tiie device of Claim 7 wherein said means for dis- s 
carding selected bits further comprising: 

; means for discarding low priority bits associ- 
ated with frames in an enhancement layer of a 
scalable bitstream. w 

11. The device of Claim 7 wherein said means for dis- 
carding selected bits further comprising: 

means for discarding packets of low priority bits is 
associated with segments in a packet of video 
bits delimited by resynchronization markers or 
picture headers. 

12. The device of Claim 1 1 wherein the means for dis- 20 
carding packets of low prfority bits further com- 
prises: 

means for selecting packets to discard based 
on historical information of packets dropped In 25 
a previous frame to avoid repetitive removal of 
klentical spatial areas of a scene. 
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